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(54) Pipeline data processing apparatus and method for executing a plurality of data processes 
having a data-dependent relationship 



(57) A pipeline which allows one or more Through 
cycles to be assigned to pipeline stages is so consti- 
tuted that a stage in which a data process is executed 
does not remain in a fixed position but shifts to an arbi- 
trary stage. Consequently, even when a plurality of 
instructions to perform a plurality of data processes hav- 
ing a data-dependent relationship therebetween are 
issued simultaneously to a plurality of pipeline data 
processing circuits, the antecedent one of the data 
processes on which the subsequent one of the data 
processes is data-dependent is executed first by the 
pipeline processing circuit in charge of the antecedent 



data process and, at a point of time one cycle after the 
execution of the antecedent data process, the subse- 
quent data process which is data-dependent on the 
antecedent data process is executed by the pipeline 
data processing circuit in charge of the subsequent data 
process. As a result, a large numt>er of consecutive 
instructions can be issued without stagnation, so that a 
plurality of data processes having a data-dependent 
relationship therebetween can be executed at high 
speed. 



J»6 ^19 




JOG . . J01 . ^102 



Fig3 



1^ ,60 




102 

DATA READING STAGE 



OPERATION 
EXECUTING STAGE 



Through STAGE tH 



Throufih STAGE I 2 ) 



DATA WRITING STAGE 



99 



9i 




EP 0 730 223 A1 



Description 

BACKGROUND OF THE INVENTION • ' . 

The present invention relates to an improved pipe- 
line data processing apparatus and method for simulta- 
..neously executing a plurality of data processes by 
pipeline processing. , 

Recent years have seen remarkable improvements 
.in the performance, of microprocessors, to which 
improved clock frequencies and internal parallel 
processing have greatly/ contributed, the form.er 
Jmproyed clock frequencies may, be attributed Jo Jhe 
advancement of VLSI (Very-Large-Scale Integrated Cir- 
cuit), technology represented,. by processing technpipgy 
and high-speed circuit technology, while the latter inter- 
nal parallel processing may be attributed mostly to the 
advancement of architecture technology represented by 
pipelining technology and sparse color technology. 

In the internal parallel processing, the above-men- 
tioned pipelining technology is temporal paralle! 
processing technology whereby a sequence of . data 
processes are divided into a plurality of stages (pipe- 
liried) so that the plurality of .data processes are per- 
formed in an overlapping manner. On the other hand, 
the sparse color technology is spatial parallel process- 
ing technology whereby a plurality of data processes 
are perbrmed sinriultaneously in parallel.. By usjng 
these two technologies in combination, the improve- 
ments in the performance of microprocessors have 
been achieved. 

Referring now to FIG. 1 7, a desaiption will be given 
to an example of the structure for spatially as well as 
temporally performing a plurality of data processes in 
parallel, which is obtained by applying the foregoing 
sparse color technology to a conventional pipeline data 
processing apparatus for.tennporally performing a plu- 
rality of data processes in parallel. 

. . In the drawing are shown pipeline data processing 
circuits 150, 151, and 152 an-anged in parallel to each 
other. The parallel arrangement enables the pipeline 
data processing circuit 150, 151, and 152 to spatially 
process three sets of data. Since the pipeline data 
processing circuits 150 to 152 have the same structure, 
a description will be given only to the internal, structure 
of the pipeline data processing circuit 150 as a repre- 
sentative of the three circuits. In the pipelirie data 
processing circuit 150, data is inputted to each of first 
and second input ports 118 and 119, first and second 
registers 120. and 121 store, respective data inputted.to 
the above input ports 118 and.1 19, a 2-input adder/sub- 
tracter 130 receives the respective data stored inJhe 
two registers. 120 and 121 so as to periform . addition .or 
subtraction with respect to .tfie respective data,, and a 
register,.132 receives an operation .result 131 from the 
adder/subtracter 130. 

In each of the- above pipeline data processing cir- 
cuits 150 to 152, a data process is pipjelined .in three 
stages which are: a data reading stage; an operation 



executing stage wherein the read data is subjected to 
addition or subtraction 'actually performed; and a data 
writing stage wherein an operation result is stored. In 
FIG. 17, registers 140a* to 140f are for preliminarily stor- 
5 ing respective data to be supplied to . the 2-input 
adder/subtracter 130 of each of the pipeline data 
processing circuits 150 to 1 52 and subjected to an oper- 
ation performed theret^y, while registers I40g to I40i 
are for' storing operation results outputted from the 
10 i-espective pipeline data processing circuits 150 to 152. 
Below, a description will be given to the operation of 
the above conventional pipeline data processing appa- 
ratus shown in FIG. 17. 

f^irst,. a consideration will be giveri to the case 
15 where tfiree types of adding operations C, D, arid G are 
'..executed by.' the first, second, and third pipeline data 
. processing circuhs*150, 151, and 152, respectively:. 



so 



.0 = A + B 
P = E + F. 
G-H + l. 



25 Since the above three operations have no data- 
dependent relationship,therebetween, they can be exe- 
cuted completely independently. Specifically, as shown 
in the operational timing chart of FIG. 18(a), the reading 
of data A and B, the reading of data E and F. and the 

30 reading of data .H and I can be performed simultane- 
ously in the data reading stage. Moreover,.operation of 
. A + B, operation of E > F. and operation of H +. I can be 
executed simultaneously in the operation . executing 
stage. Furthermore, the writing of data C, the writing of 

35 data D, and the vvriting of data G can be performed 
simultaneously in the data writing stage. In this manner, 
the three types of data processes can be executed in 
parallel by the three pipeline data processing circuits 
150 to 152- 

40 However, the above conventional,. pipeline data 
processing apparatus has the followihg disadvantages, 
which will be described below in detail.^ In the case 
. where the execution sequence of the following three 
adding operations C, J, arid G has been predeternruned 

45 in this order, it is assumed that adding operation C is 
performed by the first pipeline data processing circuit 
150, that adding operation J is performed by the second 
pipeline data processing circuit .151, and that adding 
operation G is performed by the third pipeline data 

so processing.circuit 152: 

\ ■ 'C = A+B\ 



55 



. ^. Z . ■ ' ^ = E + C - ^ ■ 

I. ... . • G = H + L \ \ 

^. Among the foregoing three types of operations, two 
operations C and J have a data-dependent relationship 
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therebetween. Specifically, since the second operation 
J = E + C should be performed by using the result of the 
first operation C = A + B^.'^the first operation C = A + B 
and the second operation J = E +* C, cannot be per- 
formed simultaneously Consequently, as shown in the 5 
bperationa) timing chart of FIG. 18(b), the second oper- 
ation J = E + C should be initiated one cycle after the ini- 
tiation of the first operation C = A + B. 

. The execution sequence of the second operation J 
and the third operation G is determined by either an 10 
Out-of-Order method or an In-Order method. The 
former method enables the exiecution sequence of a 
plurality of data processes to be changed, while the lat- 
ter method follows the predetermined" execution 
sequence of a plurality of data processes and 'inhibits is 
the data processes from being' transposed, fh the case 
where the use of the Out-of-Order method is pi'esunied. 
therefore, the third operation G = H + I can be perforrhed 
simultaneously with the first operation C ahead of the 
second operation J, as shown in FIG. 18(b). In the case 20 
where the use of the In-Order method is presumed, on 
the other hand, the third operation G should not be per- 
formed ahead of but should be performed simultane- 
ously with the second operation J, i.e.. one cycle after 
the execution of the first operation C. since the transpo- ' 25 
sition of the operations is inhibited. 

As described above, in the conventional pipeline 
data processing apparatus, the plurality of operations C 
and J cannot simultaneously be executed' whether the 
In-Order methbd or the Out-of-Order method is used. 30 
As a result, if a data-dependent relationship exists 
between' a plurality of data processes, instructions to 
perform the respective operations C and J are not - 
issued simultaneously to the respective pipeline data 
processing circuits 150 and 151 and the instruction to 35 
perform the second operation J is issued one cyde after 
the first instruction to perform the first operation C. In 
the case where the use of the In-Order method is pre- 
' sumed. on the other hand, an instruction to perform the 
subsequent third operation G having no data-depend- <o 
ent relationship is also issued one cycle after the 
instruction to perform the first operation C. Hence, the 
conventional pipeline data processing apparatus has 
the disadvantage of considerably low processing speed 
because a plurality of data processes having a data- 45 
dependent relationship therebetween cannot be per- 
formed at high speed. 



SUMMARY OF THE INVENTION 



so 



It is therefore an object of the present invention to 
provide a pipeline data processing apparatus wherein, 
even when a data -dependent rAlatinnRhin AYiRts 
between a plurality of data processes, a plurality of 
instructions to perform the processes can be issued 55 
simultaneously to respective pipeline data processing ' 
circuits such that the plurality of data processes can 
adequately be executed, v\^ether the In-Order method 
or Out-of-Order method has been adopted. Moreover, 



even when a data process having no data-dependent 
relationship succeeds the data processes having a 
data-dependent relationship therebetween, the pipeline 
data processing apparatus enables an instruction to 
perform the succeeding data process to be .issued 
simultaneously with the instructions to perforni the pre- 

' ceding data processes having the data-dependent rela- 
tionship therebetween; thereby improving the 
performance of processing speed. 

To attain the above object, the present invention 
has adopted a structure provided with a plurality of 

' stages obtained by adding a specified number bf stages 
to the three stages which are a data reading stage, an 
operation executing stage, and a data writing stage 
such that the above data reading stage and dperation 
executing stage can shift; if necessary, to ah 'arbitrary 
stage in the plurality of stages. " 

Specif ically a pipeline data processing method of 
the present invention whereby a plurality of instructions 
to perform respective data processed'are executed by a 
plurality of pipeline data processing "circuits comprises 
the steps of: simultaneously issuing, when at least two 
data processes have a data-dependant Velaiionship 
therebetween, instructions to perform the respective 
data processes to the respective pipeline data process- 
ing circuits: then executing first the antecedent one of 
the above data processes on which the subsequent one 
of the above data processes is daia^ependent by the 
pipeline processing circuit in charge of the above ante- 

"cedent data process; and subsequently executing; at a 
point of time one cycle after the execution of the above 
antecedent data process on which the above subse- 
quent data process is data-dependent, the subsequent 
data process by the pipeline processing circuit in charge 
of the above subsequent data process. 

A pipeline data processing apparatus of the present 
invention comprises: a plurality'of pipeline processing 
circuits; and an instruction issue control portion for 
simultaneously issuing a plurality of instructions to per- 
form respective data processes to the ^bove respective 
pipeline processing circuits, wherein each of the above 
pipeline processing circuits comprises: an input data 
register for receiving and storing data from the outside; 
a data operating circuit for receiving the data stored in 
the above input data register and performing a specified 
operation with respect to the input data; an intermediate 
pipeline register disposed in a stage subsequent to the 
above data operating circuit; a final pipeline register dis- 
posed in a stage subsequent to the above intermediate 
pipeline register; and a path switching circuit for causing 

' an operation result from the above data operating circuit 
to pass through br bypass the above interm^iate pipe- 
line rpnrQtar%hH Tririftttinn cairi nnoratinn rocijH'tO the 

' ajobve fihal pipel^ the above instruction issue 

' control portion addirig the number of Dumrhy stages to 
'■be Inserted,' "in each' of ^ich nothing' is executed, to 
each of the above instructions simultaneously issued, 
thei-eby-ehabling '^n operation executing stage in which 
the above data opef&ting circuit performs the above 
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specified operation to shift to an arbitrary stage in a plu- 
rality^ of pipeline stages. 

. With the above structure, a plurality of (e.g., three) 
pipeline data processing circuits are arranged in parallel 
according to the preserit invention. In the case where a 
data:dependent relationship exists between the plurality 
of (three) data processes (e.g., when operation C = A + 
B, operation J = E.+ C, and operation K = H + J), a plu- 
rality of instructions to perform the plurality of data proc- 
,esses are issued simultaneously to "the ab^ove 
respective pipeline data processing circuits. In the pipe- 
line data processing circuit for performing the first data 
process .C, a data reading stage and an operation exe- 
cuting stage are assigned to the first and second . 
stages, respectively. In the pipeline data processing dr- 
. cuit for performing the second data process J, the data 
reading, stage and operation executing stage are 
assigned to the second, and.third stages.* In the pipeline 
data processing circuit for perforrriing the third data 
process K. the data reading.stage and.operatipn execut- 
ing stage are assigned to the third and fourth stages. 
Consequently, the plurality of data processes C. J, and 
K are sequentially executed so that, even when a data- 
dependent relationship exists between these data proc- 
esses, the data processes are adequately, executed. 

Here, the respective instructions to' perform the 
.above plurality of data processes C, J. and .K are issued 
simultaneously and the subsequent instruction (i.e.. the 
instruction to perform the fourth data process) is issued 
in the second cycle. In the conventional embodiment, 
the fourth instruction is issued in the fourth cycle. Con- 
sequently, according to the present invention, a large 
number of consecutive instructions can be. issued with- 
out stagnation so that the plurality of data processes 
having a data -dependent relationship therebetween can 
• be executed at high speed. ^ . 

The above objects and novel features of the present 
invention will be more apparent frorn the reading of the 
following description in conjunction with the acconnpa- 
nying drawings. * : 

BRIERDESCRIPTION OF THE DRAWINGS 

. The accompanying drawings illustrate the prefen;ed 
ernbodiments of the present invention. 
In the drawings; 

FIG. 1 shows the overall structure of a processor; 
FIG. 2 illustrates the operation of pipielining opera- 
tion; . . 

FIG. 3 shows th^. overall structure, of a pipeline data 
• processing apparatus according to a.first^ernbpdi- 
ment of the present invention; ^ . .^ .u .. , 

FIG. 4 shows the internal structure of a pipeline, 
data processing circuit constituting the pipeline 
dataprocessing apparatus;^ 
FflG. 5 shows a first variation .of the pipeline data 
processing circuit;- ^ . . ■ . 



FIG. 6 shows a second Va/iation of the pipeline data 
processing circuit; , ' 
FIG. 7 shows a third variation of the pipeline data 
processing circuit; 
5 ' . FIG.. 8 is a timing chart showing a basic operation of 
the pipeline data processing circuit; 
FIG,. 9 is a timing chart showing another basic oper- 
ation of the pipeline data priocessihg circuit; 
* FIG., Id is a timing chart showing still another baisic 
10 operation of the pipeline data processing circuit; 

FIG. il is an operational timirig chart in the. case 
' where three data, processes 'are performed by a 
pipeiihe data processihg apparatus, according to 
the preseint invention; 
75 / FIG. 12 is an operational tirhing chart in the case 
/ where a large number of data processes are per- 
' formecj , by a conventional pipeline data processing 
apjDaratus; ... 
, FIG. 13 is an operational timing chart in the case 
so _ / where a plurality of data processes are perfornried 
by the conventional pipeiihe data processing appa- 
.'ratus;, " . 

FIG. 14 shows the ovei'all structure of a pipeline 

data processing apparatus according to a second 
25 . . embodiment ^of the present invention; ' 

FIG. 1 5 shows the structure of a principal portion of 

a processoV including the pipeline data processing 

apparatus according to the second embodiment of 

the present invention; 
30 FIG. 16 is an operational timing chart in the case 
. where six data processes are performed by the 

pipeline data processing. appariatus.' according to 

the second emtxxiiment of the present invention; 

FIG. 17 shows the overall structure of a conven- 
35 . tional pipeline data processing apparaitus; 

FIG. 18(a) is ah operational timing chart showing a 

basic operation of the conventional pipeiihe data 

processing apparatus; r 

F\G. 18(b) is an operational timing chart showing 
4o" another basic exanple of the conventional pipeline 

data processing apparatus; and ' 

F\G. 18(c) is an operational timing chart showing 
. . still , another basic operation of the conventional 

pipeline data processing apparatus. 

45 . . ^ ...... 

. DETAILED DESCRIPTION OF THE INVENTION 

Below, the preferred embodiments of the present 
' .invention will be. described with reference to the draw- 
so ings. . . 

. .(First Embodiment of Pipeline Data Processing Appara- 
tus) ' 

55. fIg.. 1 shows a first embodiment of the. present 
invention, in which are shown: a processor 1 ; an instruc- 
.tion memory 2a for preliminarily storing ari instruction 
- program: and a data memory 2b for preliminarily storing 
a large number of data sets. It is not necessarily 
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.required to provide the atiove instruction memory 2a the third stage, data is read from the register file 9. In 

and data memory 2b as separate components. A large the fourth stage, an operation is executed with respect 

number of instructions and data sets may be stored to data. In the five stage, the operation result is written 

together In a single memory; • *= ' jn the register file 9. The above second and third stages 

The above processor 1 cprisists of an instruction s may be implemented by a single stage, 
control portion 3 and an instruction execution portion 4 FIG. 3 shows a specific structure of the above exe- 
for executing such ari ' instruction to perform the ' aition processing portion 1 0 as a pipeline data process- 
processing of data. The above instruction control por- ing apparatus of the present invention, in the drawing, 
tion 3 consists* of. an^ instruchiori fetch portion 5, an reference numerals 100. 101, and 102 designate pipe- 
iristruction register 6, and an instruction decoding por- io line data processing circuits an-anged in pai'allel to each 
tion 7, and'an instruction issue control portion 8. The other. Since the pipeline data processing circuits 100 to 
above instruction fetch portion 5 generates one or a plu- 102 have the same structure, a description will be given 
rality of instruction addresses ahd trahsmits'them to the to the interndl structure o1 the pipeline data processing 
above instrudion rhemory 2a, while fetching brie or a circuit' 100 as a representative of the three circuits. In 
plurality of instructions necessary. The ib6ve| .fetched 15 the drawinig are shown: first and second data input ports 
instructions are stored by the above instruction fetch 18 and 19'for i-eceiving data from th6 above register file 
portion 5 in the instruction register 6. The'abo^ietnstruc- 9; first and second data input' registers 20 and 21 for 
tion decoding portion 7 decodes every specified number storing respective sets of data received from the above 
(e.g., three).of instructions stored in the instruction reg- respective data input ports 18 and 19; and a 2-input cal- 
ister 6. The" above instruction issue cbntrol ^portion 8 20 culator (data calculating circuit) for receiving the respec- 
issues an instruction to the resource corresponding to five sets of data stored in the above two data input 
the instruction based on the result of decoding. For registers 20 and 21 and' executing addition or 'subtrac- 
exanrple, if the instruction directs the processing of tion with respect to the respective sets of data:' 
data, the instruction issue control portion 8 issues rt to In the drawing are also shown: a first (intermediate) 
the above instruction execution portion 4. If the ir^struc- 25 pipeline register 40 disposed in a stage subsequent to 
tion is a branch instruction to control the stream of the the above calculator 30; a second (ihtermediate) pipe- 
instruction program, the instruction issue control portion line register 60 disposed in a stage' subsequent to the 
8 issues it to the instruction fetch portion 15. Moreover, if above first pipeline register 40; and a selector (path 
the operation of the above instruction issue control por- ^ switching circuit) 70 disposed in a stage subsequent to 
tion 8 is to be described roughly, it examines/ when the 30 the second pipeline register 60. The above first pipeline 
instruction is to be issued to the instruction execution register 4b receives a data operation i^esult 31 outputted 
portion 4, the state of the resource required for execut- from the above calculator 30 and stores rt. The above 
ing the instruction in the instruction execution portion 4 second pipeline register 60 receives the output data 
and the presence or absence of a'data-dependent rela- from the above first pipeline register 40 and stores' it. 
tionship betWeehfa plurality of instruction data sets and 35 The above selector 70 receives the data operation 
issues only an executable instruction to' the instruction result 31 outputted from the above calculator 30 and the 
execution portion 4. As for the irrternal structure and respective sets of data stored in the above first knd sec- 
"operation of the 'above instruction issue control portion ond pipeline-registers 40 and 60. selects any one of the 
8, they will be described later in detail. • ' • three, and outputs the selected one. In the drawing, a 

The above instruction execution portion 4 consists 40 reference numeral 81 designates a' final pipeline regis- 

ofa register file 9 for temporarily storing data, ah execu- ter lor receiving the output data 80 from the above 

tion processing portion VO for performing an operation selector 70 and stores it. . ^ - . 

with respect to data and an execution control portion 11. In each of the above pipeline data processing cir- 

The above execution control portior) iV^loads data cults 100 to 102, the above registers 20, 21, 40, and 60 

required by the exedrtion processing portion' 10 to per- 45 and selector 70 provided therein are controlled by the 

form an operation with respect to data from the above above execution control portion 1 1 of FIG.' 1 (FIG. 4 

data memory 2b,*while storing the data in the register shows only the pipeline data processing circuit 100). 

file 9. In actual execution of the operation in the execu- Reference numerals 9a to 9i designate registers in the 

tion processing portion 10. the above execution control above register file 9. • 

■ portion 1 1 reads data stored in the above register file 9 so - ' The above pipeline data processing circuit 100 may 
and controls the execution processing portion 1 0 so that also be composed of any of the pipeline data process- 
the operation in accordance with the instruction is exe- " ing circuits shown in FIGS;' 5 to 7. The pipeline data 
CLrtftH thftrpihy anH Rto^es the Operation res'jh In l+re reg- preceding 'tircLiit 120 -chc'-vn in F!G. 5 ic cbtcir.cd by 
ister file 9. " increasirig the number of pipeline stages of the pipeline 
In the above processor 1 , the operation with respect 55' ' data processing circuit 100 of* FIG. 4 by one. Specifi- 
to data is performed by pipelining processing. As shown" - ' cally, in FIG, '4;'ari additional pipeline register 90 and an 
in FIG. 2, the pipeline* preceding consists' 6f "five additional selector (path switching circuit) 9'1 are pro- 
stages. In the f irst:stage, an instruction is fetched. In the . vided between the selector 70 and the final pipeline reg- 
second stage, the instruction is decoded and issued. In ister 81 . The above additional pipeline register 90 
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receives the result of selection from the selector 70 and 
stores it. The above additional selector 91 receives the 
operation result 31 from the calculator 30 and the 
respective sets of data stored in the intermediate (first 
and second) pipeline register 40 and 60 and in the addi- 
tional pipeline register 90, selects any one of the three, 
and outputs the selected one. The. output 92 from the 
second selector 90 is inputted to the final pipeline regis- 
ter 81 and stored'therein. The pipeline data' processing 
circuit 130 of FIG. 6 is obtained by providing a pipeline 
register 30a in the calculator 30* for execution of addi- 
tion or subtraction in a 2-stage pipeline. In the pipeline 
data processing circuit 140 of FIG. 7, a fiVst selector 52 
is interposed between the first and second pipeline' reg- 
'isters,40 arid 60 and a'secdiid selector 72 is interposed 
between the second and final pipeline registers 60 and 
81.' The above first selector 52 receives the operation 
result fronri the calculator 30 and the data stored in the 
previous-stage (first) pipeline register 40, selects either 
one of the two, ahd.outputs the selected one. The above 
second selector 72 receives the result of selection from 
the above first selector 52 and Xhe diata , stored in the 
. previous-stage (second) pipeline register 60. selects 
-either one of the two, and outputs it to the final pipeline 
register 81 . The calculator 30;in each of the above pipe- 
line data processing circuits 120 and 1 40 of FIGS. 5 and 
7 may also be composed of the calculator 30* in the 
pipeline data processing circuit, 130 of FIG. 6. To com- 
pose the three pipelirie data processing circuits 100. to 
1 shown in FIG. 1 , any one or some of the processing 
circuits 120 to 140 described above may be used singly 
or in combination. However, in the case where some of 
the pi-ocessing circuits 120 to 140 are used in combina- 
tion to compose the pipeline data processing circuits 
100 , to 102, their pipelines should have the' sarne 
riumber'of stages. . 

UexX, a description will be given to the execution of 
one operation by the single pipeline data processing cir- 
cuit (100, for example) before a description will be given 
to the operation of the pipeline data processing appara- 
tus shown in FIG. 3 wherein a plurality of operations are 
performed in parallel. Since the pipeline data process- 
ing circuits shown in FIG. 4 to FIG. 7 exhtoit the same 
^ basic operation, a description will be given below to the . 
pipeline data processing circuit 100 of FIG. 4 as a rep- 
resentative. 

The pipeline data processirtg circuit^ 100 enables 
the execution of data processing, with three timings 
shown in FIGS. 8, 9, and 10. In FIGS. JB to.10, respective 
cycles , for illustrating the timings .for, the , pipeline 
processing are designated as first to fifth^cycles,. while 
respective stages in hardware:for jllustrjating fa,,f low. of 
data in the pipeline data processing circuit -100 ar^. des- 
ignated as a data reading stage, an ope'ation executing , 
stage, a Trough (1) stage (first pass stage), a Trough.(2) 
stage (second pass stage), and a data writing stage for 
the convenience of illustration. A data process is basi- 
cally executed in five pipeline stages. In RGS. 8'to 10. 
the timing for the pipeline processing is associated with 



an actual flow of data iri the'pipeline data processing cir- 
cuit. ' , 

FIG. 8 shows the most basic operation of the pipe- 
line data processing circuit 100. .The data processing 
5 (e.g.. operation C = A + B) is executed as follows: The 
selector 70 selects the output from the second pipeline 
register 60. In the first cycle, data A and B is read and 
stored in the respective input registers 20 and 21 in the 
data reading stage. In the second cycle, the operation of 
10 . A.-»- B is executed in.the.operation executing stage and 
the operation result C is stored in the first pipeline regis- 
ter 40. In the. third cycle, data" (operation result ..C) 
passes through the Trough (1) stage and. is. stored as it 
is in the second pipeline register 60. In the fourth cycle. 
75 ^.'d'ata '(operation result C) passes through the Trough (2) 
stage and is' stpred as it is in the third pipeline register 
' 80. Thus, in the two Trough stages, no processing is 
performed wjth respect to the data (operation result C) 
and the d^ta simply shifts to. the subsequent stages in 
20 . sequence.. In the fifth cycle, the operation result C is 
. wrrtten [n the data writing stage. In FIG. 8, the data flows 
' in the pipeline data processing circuit .100 wrthout stag- 
nation. . . . - „ 
In FIG. 9, the first cycle of the, f ive.-stage pipeline is 
25 used as a Dummy cycle. The selector 70 selects the 
output from the first pipeline register 40. In the first cycle 
(Dummy cycle), nothing js executed so that the pipeline 
_ is on hold. As a result, even vy(hen the timing proceeds 
' to the subsequent .second cycle, the data does not actu- 
30 ally .shift from the data reading stage to the subsequent 
operation executing stage. It is not until in the second 
cycle that.the data A and B is read in the data reading 
stage and stored in the respective registers 20 and 21. 
. In the third cycle, operation of A + B is executed in the 
35 operation executing stage and the operation result C is 
stored in the first pipeline register 40, In the fourth cycle, 
the operation result C passes tiirough the Trough (1) 
stage and is stored jn the third pipeline register 80 via 
_ the selector 70. In the fifth cycle, the operation result C 
40 is written in the data writing stage. Bri^ly, the data from 
- the Trough (1) stage (i.e.. the operation result C stored 
in the first pipeline register 40) bypasses the Trough (2) 
stage and shifts to the data verting stage. 

Here, it should be noted that, when the first cycle is 
45 a Dummy, cycle, the d^ata J and K for the subsequent 
data process (e.g., L = J -i- K) cannot be inputted in the 
second cycle, in this case, since^the data reading stage 
is reached only in the second cycle, the subsequent 
data J and K can be inputted only when the data A and 
50 B currently executed shifts to the operation executing 
.stage, i.e.. the third cycle. 

-.FIG 10, illustrates the operation wfhen Dummy 
...cycles are .assigned to the first and second cycles. The 
. selector 70 selects the output from the calculator 30. In 
55 the first and ,second cycles (Dummy cycles), nothing is 
-executed -so that the pipeline is on hold. As a result. 
. . even .vvhen the timing proceeds. to. the subsequent third 
cycle, the data A and B does not actually shift from the 
data reading stage to the operation executing stage. It is 
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not until Jn the third cyde that the data A and B is read 
and stored in "the respective registers 20 and 21 in the 
data reading stage. In the fourth cycle, operation of A + 
B is executed in Ihe operation executing stage and the 
operation result C is stored in' the^ third pipeline register 
'80 via the selector 70. In the fifth cycle, the above oper- 
ation result C written in the data writing stage. Briefly, 
the data Irom the operation executing stage (operation 
result C) bypasses' the Trough (1) and (2) stages and 

• shifts to the data^writing stage. * 

Here, it should be noted that, when the firsthand 
second cycles are Dummy cycles, the data for the sub- 
sequent data process cannot be inputted until the third 
cycle is initiated. In this case, since the data re^d cycle 
is reached for the first time in the third cycle', the Subse- 
quent data can be inputted only when the dala A ahd B 
currently executed shifts to fie operation' executing 
stage, i.e., the fourth cycle." 

Next, a description will be given to the internal 
structure of the above instruction issue cdntrol portion 8 
of FIG. 1- As shown th the di-awing, the instruction issue 
conti'ol portion 8 comprises; a datia-dependenicy detect- 
irSg portion 8a; a resource hazard control portion 8b; 
and an issue control portion 8c. The above data- 
dependency detecting portion 8a receives information 
obtained through the decoding of a plurality of (three) 
instructions from the above instruction decoding portion 
7 and judges, based on the information, the presence or 
absence of a'data-depi^ndent relationship between the 

' data processes (e.g., C = A + B, E = C + D, and G = E + 
Fj' directed by the three instructions. The above 
resburce hazard control portion 8b receives a signal 
rejDresehting that any' of the above ' pipeline data 
processing circuits 100 to 102 is executing the Dummy 
cycleshownin FIGS. 9and 10 (Dummy-cycle-execution 
signal), judges that thiere is a resource hazard on 
receipt of the signal, and outputs the resource hazard 
signal to the issue control portion 8c. The above issue 
control portion 8c judges whether or not each of Ihe 
pipeline data processing circuits TOO to 102 is in the 
executable state in which the instruction is executable 
and further judges, H it has judged that the instruction is 
executable, whether or not the above Dunrimy cyde 
shown in FIGS. 9 and 10 should be inserted. If the 
Dummy cycle should be inserted, the issue control por- 
tion 8c determines the number of Dummy cycles to be 
inserted and adds the number to the instruction and out- 
puts one or a plurality of instructions to the above exe- 
cution control portion T1 such that the iristruction or 
instructions are executed iri the pipeline data process- 
ing drcuit or" circuits in the "executable state. 

Specifically, the above issue control portion 8c 

jkjdgcS vvMCiMcr or riCt each of the pipcnnc data piOwco2~ 

ing circuits 100 to 102 is in the executable' state by 
examining whether or not each of the pipeline dita 
processing circuits '100 to 102 is executing a Dufhmy 
cycle; if each of the pipeline data processing circuits 
100 to 102 is not executing a Dummy cycle, the ii&sue 



control portion 8c judges that it is in the executable 
state. 

The insertion or no insertion of a Dummy cyde arid 
the number of Dummy cycles to be inserted are deter- 

5 mined in the following manner. First, it is judged whether 
or nota data-dependent relationship exists between the 
data process previously executed and the data process 
to be subsequently executed or between a* plurality of 
data processes to be simultaneously executed. If there 

10 is no data-dependent relationship, it is' determined that 
the number of Dummy cycles is 0, which indicates that 
no Dummy cycle should be iriserted. if there is any data- 
deperident relatioriship, on the other hand, the number 
of Dummy cycles. to be inserted is determined as fol- 

75 lows: For ease of understanding, a consideration wiH be 
, given to the case where thrjee data processes C = A + 
B, D = E + C, and G = F + D are executed in orderly 
sequence,, as shown in FIG. 11. The riumber of Dummy 
cycles to be inserted is determined first for the data 

20 process. to be executed with the highest priority and 
then determined, sequentially for 'the other processes 
with lower priorities. If tiie number of pipeline stages of 
. the calculator 30 in the pipeline data processing circuit 
(which is "1" in the pipeline data processing drcuit of 

25 FIG. 4 and is "2*' in the plipeline data processing circuit 
of FIG. 6) for executing the data process D (or G) which 
has a data-dependent relatiohship with the antecedent 
data process C (or D) is designated at a (by way of 
example, a jdescriptipri will be given hereinafter to the 

30 pipeliriedata processing circuit of FIG. 4in which a = 1), 
ttie nuntier of Dummy cycles to be inserted iri the ante- 
cedent data process C (or D) which is executed by the 
' pipeline data processing circuit arid on which the subse- 
quent data process D (or G) is data-dependeiit is d^sig- 

35 hated at b, and a difference in the initiating cycles of the 
above data processes having a data-dependent rela- 
tionship (C and D or D and G) is designated at c, the 
number x of Dummy cycles can be calculated by the fol- 
lowing equation: 

40 • ■ '" ' - • ' * 

" X = a + b - c. 

In the above example shown in FIG. 11, since the 
• first data process C = A + B has no antecedent data 
45 process on which it is data-dependent, the number of 
Dummy cydes is "0". As for the second dkta process D 
= E -»- C which has a data-dependent relationship with 
the first data process C, the number b of Dummy cycles 
to be inserted in the data process C is "0" and the differ- 
so 'ende c in the initiating'cycles of the two data processes 
• ^^is ^O" (srhnultaneous execution), so that the number x of 
- Dummy cycles tb be calculated becomes 1+0-0 = 1. 
? va\^ f^^^^^'^f^]^^^^^' pr333^^G = F ; D v/hich hss 2 
' d^ta<lepend'ent relationsh with" the antecedent- data 
55 --'pro'te^ Dl^%V number^- b *; of Dummy cydes to be 
inserted in the bata process D on which the^ data proc- 
ess G^is. dependent is ^l • and a difference c'in the initi- 
"ating cycles of the' two processes is "O" (simultaneous 
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execution) /so that the number x of Dummy cycles to be 
calculated becomes 1 + 1-0 = 2. 

As will be appreciated from FIG. 1 1 . of the three 
data processes C. D. and G, the first data process C is 
executed with the timing of FIG. 8, the second data 
process D which has a data-dependent relationship with 
the antecedent process C is executed with the timing of 
FIG.S. and the third data process G which has a data- 
.dependent relationship with the antecedent process D 
is executed with the timing of FIG.' 10.* Consequently 
even when respective instructions to perform the three 
data processes C. D, and (2 having a data-dependent 
relationship therebetween are. issued simultaneously, 
they^ can be executed without stagnation. The data proc- 
esses C, p, and G are. completed at the same time, so 
that the In-Order method is surely irnplemented, 

In FIG.. 1.. the execution control portion 1J receives 
an instruction Issued from the issue control portion 8c 
and causes that one olthe pipeline data processing cir- 
cuits. 100 to 102 which is designated by the instruction 
to execute the data process directed by the instruction, 
The above execution cpntrol portion 11 controls the 
selector 70. of Jhe above designated, pipeline data 
processing circuit so that the output from the second 
pipeline register. 60, is selected when the number x of 
Dummy cycles that has been added to the correspond- 
ing instruction is 0. that the output from the first pipeline 
register 40 is selected when the number x of Dummy 
. cycles is 1 , and that the output from the calculator 30 
(operation result) is selected when the number x of 
Dummy cycles is 2. If. any of the pipeline data process- 
. ing circuits 100 to. 102 is executing a Dummy cyde.-the 
. atove execution control portion 11 transnriits a Dummy- 
cycle-execution signal to the above : resource hazard 
control portion 8b. . 

Next, referring to FIG. 12. the operation of the pipe- 
line data processing apparatus of the present embodi- 
ment will be described by using a rnore complicated 
data processing sequence: 



C = A + B 


(1) 


E = C + D 


(2) 


G=E + F 


(3) 


J = H-hI 


(4) 


L=J+.K^ - 


(5) 


N = L + M 


(6) 


P.=.N.+ Q l i^. 




S=Q+R . 


. '. : (8) 


.u=S.+T.-. ,. 


• (9) 


x = u + v 


(10). 



Of these ten data processes, each of the data proc- 
esses except the data processes (1). (4), and (8) has a 
data-dependent relationsHip'with the aritecedent data 
process. 

5 In. FIG. 12, three instructions to perform the data 
processes C. E, and G are decoded first and issued 
simultaneously The first data process C, the second 
data process E, and the third data process G proceed in 
the pipeline data processing circuits 100, 101. and 102, 

10 respectively Among these, a data-dependent relation- 
ship exists between the first and second data processes 
C and E. while a data-dependent relationship also 
exists .between the second and third data. processes E 
and G. The number x of Dummy cycles to be inserted in 

75 the pipeline , data processing circuit 101 becomes 1, 
since a = 1, b = 0. and c = 0 in the above equation .for 
jcalculation. The number x of. Dummy cycles to, be 
inserted in the pipeline data processing circuit 102 
becomes 2. since a = 1 , b = 1 , and c = 0 in the above 

20 equation fpr.calculation. 

, in the fjrst cycle, three instructions to perform the 
subsequent three data processes. J,. L. and N are 
decoded. Since, the two pipeline data processing cir- 
cuits 101 and, 102 are executing Dumrny cycles, 

25. resource hazard signals are ^ generated , for these 
processing circuits 101 and 102. In response to these, 
only an instruction to perform the fourth data process J 
is issued so that the data process J proceeds Jn the 
pipeline data processing circuit 100.. Since the fourth 

30 data process J has no data-dependent relationship with 
any of the existing data processes C, E, and G, the 
number. X of Dummy cycles js 0 in the pipeline data 
^ processing circuit 1 00 for performing the data process J. 
In. the second cycle, the other two instructions that 

35 have not been issued in the preceding cycle and an 
.instruction to perform the: seventh data process P^are 
. decoded.- Since only the pipeline data processing circuit 
102 is executing.a Dummy cycle, a resource hazard sig- 
nal is generated , for the processing circuit 102. In 

40 response to this, two instructions to perform the fifth and 
sixth data processes ,L and N are issued and the data 
. processes L and N proceed in the pipeline data 
. processing circuits 100. and 101, respectively. Here, a 
- data-dependent relationship exists between the fourth 

45 and fifth data processes L and N, while a data-depend- 
ent relationship also exists between the fifth and sixth 
^ data processes L ard N. Consequently, the number x of 
..Dummy cycles- to be inserted in . the . pipeline data 
processing. circuit 1 00- becomes. 0, since a = 1, b = 0, 

50 . and c = 1 in the above equation, for calculation. The 
. number x of Dummy cycles to be inserted in the pipeline 
data processing circuit 101 becomes 1 , since a = 1 . b = 
0. and c = 0 in the above equation for calculation. 
In the third cycle, the other instruction that has not 

55 been issued in the preceding cycle and instructions to 
perform the eighth and ninth data processes S and U 
are decoded. Since only the pipeline data processing 
circuit 101 is executing a Dummy cycle, a resource haz- 
ard signal is generated for the processing circuit 101 . In 



response to this, two instructions to pertorm the seventh 
and eighth data processes P and S are issljed so that 
the data processes P and' S probeed in the pipeline data 
processing circuits 100 and 102, respectively. Here, a 
data-dependent relationship exists between the seventh 
and eighth data processes N and P. Consequently, the 
number x of Dummy cycles to be inserted in the pipeline 
data processing circuit ICQ becomes 1, since a = 1, b = 
1, and'C = i: On the other hand, the nurhber 'x of 
Dummy cycl^ to be inserted in the pipeline data 
processing circuit 102 is 0. since no data-dependent 
-relationship exists- between the seventh and eighth data 
processes P and S: 

In the fourth cycle, \We other instruction that has not 
been issued in the preceding cycle and an instruction to 
perlorm the tenth data process X are decod'ed; Since 
only' the pipeline data processing cirduit 100 is execut- 
ing a bummy cyde. a resource hazard signahs*gener- 
ated for the processing circuit 100. In' response to this, 
tvvo instructions to perform th^ ninth and iehth data 
processes U and X are issued so that^the data proc- 
esses U and X proceed iri the'pipeline data processing 
circuits 101 and 102; respectively. Here, a data^Jepend- 
ent relationship exits between the eighth and ninth data 
processes S and U, while a data-dependent relationship 
also exists between the ninth and tenth data processes 
U and X; Consequently, the number x of Dummy cycles 
to be inserted in the pipeline data processing circuit 101 
becomes 0, since a = 1. b = t, and c = 1 in the above 
equation for calculation. On the other hand, the number 
x of Dummy cycles to be inserted in the pipeline data 
processing circuit 102 becomes 1. since a = 1, b^=0, 
and c = 0 in the above equation for calculation. 

In the present embodiment, the point of time at 
which alt the instructions to perform the ten data proc- 
esses have bfeeri'issued falls in the fifth cycle; while the 
point of time ^t which the' execution of all the data proc- 
esses has been completed falls in the ninth cycle, as will 
be appreciated from FIG. 12. On the other hand, if the 
above data processing sequence is executed using the 
conventional pipeline data processing apparatus of FIG. 
17, the point of time at which all the ten data proceisses 
have been issued falls in the eighth cycle, while the 
point of time at which the execution of all the data proc- 
esses have been completed falls in the 'tenth cycle, as 
shown in the timing chart of FIG:* 13. ' ■ 

From a comparison of the embodiment of the 
present invention shown in FIG. t2 to the conventional 
embodiment showri in FIG. 1 3, it will be-appreciated that 
the pipeline data processing- apparatus in the embodi- 
ment of the present invention has an advantage over the 
converitional embodiment in terms of the performance 



(Second Embodiment of Pipeline Data Processing 
Apparatus) 

FIG. 14 shows a pipeline data processing appara- 
5 tus showing a second embodiment of the present inven- 
tion. 

In the present ernbodiment, three additional pipe- 
line data processing circuits 1 03 to 1 05 having the same 
structure are provided in parallel in the structure of FIG. 

10 1, Each of the pipeline data processing circuits 103 to 
105 has a multi-input calculator 30" having thre^ or 
more inputs (three inputs in the drawing). In compliance 
with the 3-input calculator 30", there have been pro- 
vided three data input ports' 17 to 19 and* three data 

T5 input registers 20 to 22 each for storing data inputted to 
the corresponding data input port. "The data stored in 
the data input registers 20 to 22 is inputted to' the above 
3-input calculator 30"' and subjecfed to addition or sub- 
traction. As for the other components, they are the same 

20 as those of the pipeline data' processing circuit of the 
first embodiment shown in FIG. 4, so that the ciescrip- 
tion thereof will be omitted by providing the same com- 
ponents with the same reference numerals. In FIG. 14, 
■ reference numerals 9j tp 9r designate r-egisters in'the 

25 riegister file 9. 

FIG.- 15 shows the structure of a principal portion of 
a processor including the above pipeline data process- 
- ing apparatus of FIG. 14. Since the overall structure of 
the processor is similar to the structure of the processor 

30 of the above first embodihient shown in FIG. i] only dif- 
ferent components will be described l^low. The proces- 
sor of FIG. 15 is different from the'processor of FIG. 1 
' only in that'the instruction issue control portion 8' with 
the issue control portion 8c' is provided instead of*the 

35 instruction issue control portion-S with the issue control 
portion Sc. The issue control portion 8c' alters a plurality 
of data processes to be executed" so as to eliminate a 
data-dependent relationship between the data*^ proc- 
esses. For example, in a plurality of (two) data proc- 

40 esses having the following data-dependent relationship 
therebetween: 

C = A+B 

45 D = E -t- C, 

the tatter data process is replaced by 

D = E-kC 
so / =E.k(A + B) 

= E-hA-hB. 

is issued so that the data process D is executed by the 
55 above pipeline data processing circuit 1 03 having the 3- 
input calculator 30". Consequently, a plurality of data 
processes originally having a data-dependent relation- 
ship can be executed simultaneously in parallel as a 
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plurality of data processes having no data-dependent 
relationship. 

Accordingly, in the case where the following six 
data processes having a data-dependent relationship 
are executed in the pipeline data processing apparatus 
of , the present embodiment: 

\C = A-^B . - . ■ 

6= F + D " " 

H= U'G : • 

L=M + J, 

instructions to perform the six data processes, having a 
datardependent relationship can be issued and exe- 
cuted simultaneously under the two effects of elimina- 
.tion of the data-dependent,re!ationship by the pipeline 
data processing apparatus having the 3-input calculator 
30" and insertion of the Dummy cycle described in the 
first embodiment. , . . 

Although the present embodiment has substituted 
the 3-input calculator 30" for the calculator 30 of the 
pipeline data processing, circuit shown in. FIG. 4, it will 
be appreciated that the calculator 30 of each of the 
pipeline data processing circuits shown in FIGS. 5, 6, 
and 7 may be replaced by the 3-input calculator 30". 

Claims 

1 . . A-pipeline data processing apparatus comprising: 

a plurality of pipeline processing circuits; and 
. an instruction issue control portion tor simul- 
taneously issuing. a plurality ot instructions to per- 
form respective data processes to said respective 
pjpeliine processing circuits, wherein* 

each of said pipeline processing circuits 
• comprises: . . 

an input data register for receiving and stor- 
ing data from the outside and; . 

a data operating circuit for receiving the data 
stored in said input data register and performing a 
specified operation with respect, to the input data; 

, an intermediate pipeline register disposed in 
a stage subsequent to said data operating circuit; 

a final pipeline register disposed, ia.a stage 
subsequent ^to said: internjiediate pipeline register; 
and ......... V. — — 

a path switching circuit for causing an opera- 
tion result from said data operating, circuit to pass 
through or bypass said interniediate pipeline regis- 
ter and inputting said operation result to said final 
•pipeline register,^ . - 

said instruction issue control portion adding 



the number of Dummy ^stages to be inserted, in 
each of which nothing i^.executed, to each of said 
instructions simultaneously issued. 

thereby . .enabling an operation executing 
5 stage in which said data operating circuit performs 
. said specified operation. tp shift to an arbitrary stage 
in a plurality of pipeline stages. 

2. A pipeline data processing apparatus according to 
10 claim 1, wherein 

the plurality of instructions simultaneously 
issued from the instruction issue control portion 
. include respective instructions to perform at least 
two data processes having a data-dependent rela- 
T5 . ; tionship therebetween. 

3. A pipeline.data processing apparatus comprising at 
. i . least one pipeline processing circuit, wherein 

. .. . , said pipeline.processing.circuit comprises: 
20 an input data register for receiving and stor- 

^ . ing data from the outside; . , ^ . . 

a data operating circuit forVeceiving the data 
stored in said input data register and performing a 
- . specified operation with respect to the input data; 
25 an intermediate pipeline register disposed in 

a stage subsequent to said data operating circuit; 
a final: pipeline register disposed in a stage 
. subsequent to said intermediate pipeline register; 
and 

30 ... a path switching circuit for causing an opera- 

tion result from said data operating circuit to pass 
through or bypass said intermediate pipeline regis- 
ter and inputting said operation result to said final 
pipeline register, 

35 thereby enabling an operation executing 

stage in which said data operating circuit performs 
said specified operation to. shift to an arbitrary stage 
in a plurality of pipeline stages. 

40 4. A pipeline data processing apparatus according to 
claim 1 or 3, wherein 

said plurality of pipeline processing , circuits 
are arranged in parallel to each other. 

45 . 5. A pipeline data processing apparatus according to 
claim 1 or 3. wherein 

the intermediate pipeline register is com- 
. posed of first and second pipeline registers, 

said path switching circuit is composed of a 
so . . , selector interposed between said second and final 
pipeline registers, and 

. ' - . said selector receives the operation result 
-from said data operating circuit and respective 
• operation results stored in said first and second 
55., pipeline registers, selects any one of the operation 
^ results, and outputs the selected one to said final 
pipeline register. . 
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6. A pipeline data processing apparatus according to 
claim 1 or 3, wherein - 

'* the intermediate pipieline register is com- 
posed of first and seddnd pipeline registers, 

said path switching circuit is composed of a 5 
first selector interposed between said first and sec- 
ond pipeline registers and a second selector inter- 
posed between said second and final pipeline 
registers, 

said first selector receives the' operation 70 
- result from said data operating circuit and an oper- 
■ ation result 'stored in said first pipeline register, 
selects either one of the operation results, and out- 
puts the' selected one to said second pipeline regis- 
ter,* and • . . . , - 

said second selector receives the operation 
result selected by said first selector and an opera- 
tion result stored in said second pipeline register, 
selects either one of the operation results/ and out- 
puts the selected one to said final pipeline register 20 

7. A pipeline data processing apparatus according to 
claim 5 or 6, wherein • " • 

a stage in which said data from the outside is 
stored iri said input data register is a data reading 2S 
stage. 

the stage in which said data operating circuit 
performs said specified operation and stores the 
operaton result in said first pipeline register is the 
operation executing stage. so 

a stage in which tiie operation result stored 
in said first pipeline register is stored in said second 
pipeline register is a first pass stage, and 

a stage in which the operation result stored 
in said second pipeline register is stored in said 35 
f inal pipeline register is a second pass stage. 

8. ' A pipeline data processing apparatus according to 

claim 1 or 3. wherein 

said data operating circuit has an internal 40 
pipeline register. . • 

. 9. -A pipeline data processing apparatus according to 
claim 1 of 3, wherein 

said data operating circuit is conrtposed of an 45 
adder or a subtracter each having tWo or more data 
input ports. * • 

10. A pipeline data processing apparatus according to 
claim 1 or 4, wherein . . : 

each of said plurality of pipeline processing 
circuits has an equal number of pipeline stages, 

finmo nf RaiH njijralrty nf pipeline prOCeSSiPQ 

circuits are composed of* pipeline- processing cir- 
cuits in each of which said data operating circuit 55 
has an internal pipeline register and the other pipe- 
line data processing circuits are coriposed'of pipe- 
line data processing circuits in each of which said 
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data operating circuit has the internal pipeline reg- 
ister. 

11. A pipeline data processing apparatus according to 
claim 5 or 6. wherein 

in the pipeline data processing circuit that 
has received the instruction, when the number of 
Dummy cycles to be inserted that has been added 
to the received instruction is "0", the data input reg- 
ister receives and stores the data from the outside 
in a first cycle, while the selector selects the opera- 
tion result stored in the second pipeline register. 

12. A pipeline data processing apparatus according to 
claim 5 or 6, wherein 

in the pipeline data processing circuit that 
has received the instruction, when the number of 
Dummy cycles to be inserted that has been added 
to the received instructictn is "1 the data input reg- 
ister receives and stores the data from the outside 
' in a second cycle, while the selector selects the 
operation result stored in the first pipeline register. 

13. A pipeline data processing appai-atus according to 
daim 5 or 6, wherein ' 

in the pipeline* data processiing circuit that 
has received the instruction, when the number of 
Dummy cycles to be inserted that has been added 
to the received instruction is "2", the data input reg- 
ister receives and stores the data from the outside 
in a third cyde, while the selector selects the oper- 
ation result from the data operating circuit. 

14. A pipeline data processing method whereby a plu- 
rality of instructions to perform respective data 
processes are executed by a plurality of pipeline 
data processing circuits, said method comprising 
the steps of: 

simultaneously issuing, when at least two 
data processes havie a data<lependant relationship 
therebetween, instructions to perform the respec- 
tive data processes to the respective pipeline data 
processing circuits; 

then executing first the ant«:edent one of 
said data processes on which the subsequent one 
of said data processes Ms data-dependent by the 
pipeline pfo6essing drcuit in change' of said ante- 
cedent data process; and 

• '* subsequently executing, at a point of time 
"one -cycle after the execution of said antecedent 
^ ' data process on 'which said subsequent data proc- 
ess^ is^diata^ependerit. the subsequent data proc- 

, r-., r- a ... .^..^.a*- 

said subsequent data process. 

15. A pipeline'data processing method whereby a plu- 
rality of -instrijctibns to perform respective data 
process^- are ^ecuted by a 'plurality of pipeline 
data processing drcuits. said method comprising 
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